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Abstract 

We study the evolution of cooperation in populations 
where individuals play prisoner's dilemma on a net- 
work. Every node of the network corresponds on an 
individual choosing whether to cooperate or defect in 
a repeated game. The players revise their actions by 
imitating those neighbors who have higher payoffs. 

We show that when the interactions take place on 
graphs with large girth, cooperation is more likely to 
emerge. On the flip side, in graphs with many cycles 
of length 3 and 4, defection spreads more rapidly. 

One of the key ideas of our analysis is that 
our dynamics can be seen as a perturbation of the 
voter model. We write the transition kernel of the 
corresponding Markov chain in terms of the pairwise 
correlations in the voter model. We analyze the 
pairwise correlation and show that in graphs with 
relatively large girth, cooperators cluster and help 
each other. 

1 Introduction 

Prisoner's dilemma has become a known benchmark 
for studying the emergence of cooperation in popula- 
tions consisting of selfish agents. In this symmetric 
2-person game, each player has two strategies, coop- 
erate (C) or defect (D). A cooperator pays a cost of 
c, and it provides the benefit of b to the opponent. 
A defector incurs no cost and contributes no benefit. 
If the game is only played once, basic analysis of the 
static game shows that the unique Nash equilibrium 
is the defect-defect strategy. 

Despite this prediction, the evolution of coop- 
eration has been observed in populations such as 
genomes, multicellular organisms, and human soci- 
ety. Such an abundance of cooperation in settings 
similar to prisoner's dilemma has motivated an ex- 
tensive literature in game theory and evolutionary 



' Department of Electrical Engineering, Stanford University, 
Stanford, CA 94305. Email vahidehhOstanf ord . edu . 

t Department of Management Science and Engi- 
neering, Stanford University, Stanford, CA 94305. 
Email IsaberlOstanf ord. edul 



biology to explain the emergence of cooperation. For 
example, for the two player repeated game, the folk 
theorem implies that if the players are patient enough 
the cooperate-cooperate outcome is an equilibrium 
path of the infinite horizon game. This is also ex- 
tended to multi-player games, games of incomplete 
information, and noisy repeated games [Sj IH [5] . This 
question is also studied in evolutionary game theory 
and evolutionary biology [71 1^1 151 [T^ . 

In this paper, we study the evolution of cooper- 
ation in populations where each agent only interacts 
with a small part of the population. In particular, in- 
dividuals play prisoner's dilemma on a network. Ev- 
ery node of the network corresponds on an individual 
choosing whether to cooperate or defect in a repeated 
game. The payoff of a node increases with the number 
of its cooperator neighbors. The network structure is 
also crucial for the evolution of play: agents revise 
their actions by imitating those neighbors who have 
higher payoffs. This is similar to the class of Im- 
itation of Success dynamics studied in evolutionary 
game theory (see e.g. Sandholm 

We give the first rigorous analysis proving that, 
defined properly, locality of interactions increases 
the likelihood of the emergence of cooperation. In 
particular, if the underling network does not have 
any short cycles, the expected number of cooperators 
eventually exceeds its initial value. At the same time, 
we discover graphs with many cycles of length 3 or 
4, in which cooperation tends to decrease because 
of a "free-riding" effect. In these graphs, the set 
of cooperators is always surrounded by a set of 
defectors. Since defectors pay no cost, when they are 
well-connected to the set of cooperators, their payoff 
will be large. Hence the probability of imitating the 
defect action will increase. 

On a more technical side, our key idea in analyz- 
ing this dynamics is that it can be viewed as a per- 
turbation of the Voter Model (VM) [Tl[6]. We write 
the transition kernel of the Markov chain correspond- 
ing to our dynamics in terms of the pairwise corre- 
lations in the voter model. We analyze the pairwise 
correlations and show that in graphs with relatively 
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Table 1: Payoff of player i 



large girth, local clustering occurs. In other words, 
we show that when the girth of the graph is rela- 
tively large, the cooperators will cluster together and 
help each other. We also upper-bound the conver- 
gence time of our dynamics using techniques similar 
to those used to bound the convergence time of the 
voter model [2 [5] . 

The dynamics studied in our paper is closely 
related to that of [71 [9]. However, our approach differs 
in two essential ways: first, unlike [7l[9], our results 
are rigorously proved. More importantly, [9], focuses 
on the average degree of nodes, and ignores the role 
of small cycles. In our examples, we give graphs with 
short cycles that do not show the behavior predicted 
ini. 

2 Problem Definition and Preliminaries 

We are given a set of players V, with \V\ = n, that 
play prisoner's dilemma repeatedly at time steps t = 
0,1,2,3,.... The players interact on an undirected 
fc-regular connected graph G = {V, E) ; each player 
only plays with its neighbors. Each player i G V has 
two alternative strategies: cooperate (C) or defect 
(D). We associate variable XI S {0, 1} to each node 
i to represent its action at time t; XI — 1 {XI = 0) 
if player i cooperates (defects) at time t. The vector, 
Xt G {0, 1}" represents the configuration of the 
system at time t. The payoff matrix is a 2 x 2- 
matrix illustrated in Table [T] Note that the game 
is symmetric. The total payoff of player i & V sX 
time t, u\, is: 



~kcXi + h^X, 



(2.1) 



where j i means that j is a neighbor of i in G. 
Further, we assume that k{b + c) < 1. 

At each time step, one of the nodes is chosen 
uniformly at random to update its action. Let Al 
be the event that at time t, node i is chosen to 
update its action. If Al occurs and i alternates 
its strategy then all the other players update their 
payoffs. Node i updates its action according to the 
following mechanism: 



Weak Imitation of Success (WIS): In the WIS 

dynamics, the updating node i samples one of its 
neighbors and imitates its action. The sampling 
is slightly biased in favor of neighbors that have 
higher payoffs. In particular, node i first samples 
a selector, St, that is an independent Bernouli(e) 
random variable, where e is a small positive number. 
If 5*4 = then node i samples one of its neighbors 
uniformly at random and copies its action. On the 
other hand, if 5't = 1, node i will take a biased sample 
among its neighbors, where i favors neighbors with 
higher payoffs. Formally, 



i samples j \ Al,Xt, {St = 1} 



It is easy to check that for k{b + c) < 1, 

0<p(i samples j \ Al,Xt, {St = 1}) < 1, 

and, 

samples j \ Al,Xt, {St = 1}) = 1. 



Putting these two cases together, the probability that 
node i updates its action to C is: 



1 - e 



1 - 



(2.3) 



It is worth noting that when e = 0, the dynamics 
coincides with the VM [l]|6]. In the VM, the sampling 
of a neighbor is uniform and the updating node is 
more likely to imitate the strategy that the majority 
of its neighbors play, regardless of their payoffs. On 
the other hand, when e = 1, the sampling is based on 
the payoff of the PD games. We call this updating 
rule the PD dynamics. Note that for any < e < 1, 
the WIS is a mixture of VM and PD dynamics. 

It can be readily seen that WIS defines a Markov 
chain on {0, 1}" with two absorbing states; the all 
zero state, X ~ 0, and the all one state, X = 1. 
Since the graph size is finite, starting from any 
configuration, the chain reaches either or 1 in a 
finite time. We denote the probability that the chain 
starting from configuration Xq converges to the all 
one state, X = 1, by t^-^js i-e.. 



where /ij {X) is the measure defined by the Markov 

chain at time t starting from configuration Xq. In 
this paper, we mainly work with one fixed initial 
condition, thus hereafter, we drop the subscripts Xq. 

3 Main Theorem 

In this section, we present the main result of the 
paper. It states that under the WIS dynamics, 
when interactions are local and the graph does not 
have short cycles, and the benefit to cost ratio b/c 
is moderately large, then the expected number of 
cooperators increases. 

Theorem 3.1. Suppose graph G is a connected k- 
regular graph with girth at least 7. Further, suppose 
at time 0, a random pair of neighbors play C and the 
rest of the nodes play D; the system evolves according 
to the WIS afterwards. For any 7 > 0, e — n'~'^'^^^\ 
^/c > '^i^d n sufficiently large, the probability 

that the chain converges to the all one state, X — 1, 
is strictly larger than 2/n. More precisely, there exists 
a positive constant f that is bounded away from zero 
and 

2 e , 

T^wis > — I — /■ 
n n 

The constant f is increasing in the ratio b/c. 

Initially, the number of cooperators in the net- 
work is 2. Eventually, when the system converges, 
the expected number of cooperators is rnrwis- When 
e = 0, i.e., when we ignore the payoffs, the expected 
number of cooperators stays at 2. Our result shows 
that in our setting, when we slightly increase the e, 
we strictly increase the value of mr^js. This is an 
indication that cooperation has a higher evolutionary 
fitness on graphs with large girth. 

Consider an updating node i and two of its 
neighbors j and h. Suppose j cooperates and h 
defects. If the set of neighbors of j and h are 
the same, which may happen in graphs with cycles 
of length 4, then the payoff of h is always higher 
than payoff of j which results in the increase in 
the probability that i samples h. On other hand, 
suppose that the set of neighbors of j and h are 
disjoint and suppose we can show that as a result 
of clustering, in expectation, node j has one more 



cooperator neighbor than h does. In this case, the 
difference between payoff oi j and payoff of h is b — kc. 
Thus if b/c > k, then the payoff of j will be higher 
than the payoff of h implying than i is more likely 
to sample j. In fact, the condition on the ratio of 
6/c>fc(l + o(l))is needed even on an infinite tree. 

Note that for each i £ V, E [liuit^^ X^] = 
T^wis- Since random variable XI is bounded, Fatou's 
lemma implies that: 

limsupE \Xi] < TTwis < liminf E [X*] 

Since limsup of a sequence is at least equal to the 
liminf, the above inequalities imply that the sequence 
E [Xl] converges to nwis- Therefore, to compute 
the probability of convergence to the all one state, 
it suffices to analyze the evolution of the marginal 
probabilities. In particular, let pi be the probability 
that player i cooperates at time t. Then for each 
i G V, limt-^oo Pt = T^wis- Also, note that the 
expected number of cooperators converges to mrwjs. 
A major part of the proof of the above theorem 
studies the evolution of the marginal probabilities 
and the expected number of cooperators. 

Main Proof Ideas: 

The key idea in the analysis is that the WIS is 
a mixture of two dynamics: the voter model (VM) 
and the prisoner's dilemma (PD). At each time step, 
depending on the selector St, the updating player 
follows one of these dynamics; if S't = the player 
follows the VM, otherwise it follows the PD. 

One of the basic properties of the voter model on 
regular graphs is that given an arbitrary configura- 
tion, at each time step t, the expected drift in the 
number of cooperators is zero: In a regular graph, 
for any edge (i,j), the probability that i copies j is 
the same as the probability that j copies i. Suppose 
at time t, i plays C and j plays D. If i copies j, 
the number of cooperators decreases by one. On the 
other hand; if j copies i, the number of cooperators 
increases by one. Since the probability of these two 
events are the same, the expected drift along each 
edge is zero, implying that the expected drift is zero 
regardless of the configuration at time t. 

Let Ng denote the number of cooperators at time 
e, i.e., Ne = J2iev^e- First, we study E[Ne] and 
compute a lower-bound for it. Remember that in the 
WIS, at each time step t = 0, 1, . . . , 0—1, we first sam- 
ple a selector, St, that is an independent Bcrnouli(e) 
random variable, based on which we follow cither VM 
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: Illustration of Claim IXTl the expected number of cooperators increases when we apply the PD for 
time in step t*. 



or PD. Given the sequence 5'o, 5*1, . . . , Sg^i, consider 
the following three cases: 

Case 1: all the selectors Sq, Si, . . . , Se-i are 
zero: 

In this case the system has followed the VM process. 
Therefore, the expected number of cooperators at 
time 9 remains 2. 



Case 2: exactly one 
^o, 5*1, ... , Sg-i is one: 



of the selectors 



Suppose the selector at time t* is one. Since the 
system has followed the VM up to time t* , the 
expected number of cooperators at time t* is 2. 
However, after applying the PD at time step t*, the 
expected number of cooperators changes. We denote 
the expected drift by E[Aj*] (see Figure [T]). A key 
part of the analysis is to show that E [A^.] is positive, 
i.e., applying the PD for one period of time results in 
an increase in the expected number of cooperators. 
More precisely, 

Claim 3.1. Given graph G, suppose at time 0, a 
random pair of neighbors play G and the rest of 
the nodes play D; the system evolves according to 
the WIS afterwards. If h/c > P/{k - I), then 
< E[At.] < 1. 

The proof of this claim is presented in Section S) 
From t* + 1 to 9, the system again evolves according 
to the VM model, therefore in the time periods t* + 1 
to 9, the expected number of cooperators remains 



the same. 

Case 3: more than one of the selectors 
S'o, 5*1, . . . , Se-i are one: 

In this case, we just lower-bound the expected num- 
ber of cooperators by zero. 

Putting the three cases together, we have: 



, [Ng] > E 



Ngl i /\{St^ 0} 



Ngl I /\ {St = 0}A {St^ = 1} 
0-1 

= 2(1 - ef + e(l - ef-^ ^ (2 + E [A^.]) . (3.4) 



f=0 



Define, 



'5« = ^Ee[a, 



(3.5) 



t*=o 



We can rewrite [3^ as. 

E [Ng] > 2(1 - ef + [2 + Sg] 9e{l - ef, (3.6) 

Fimction g{e) = (1 — e)^ is convex for 9 > 2, therefore 
(1 — e)^ > 1 — e9. Substituting this in p.6p . we have: 

E [Ng] > 2-[2 + 6g] 9^e^ + 9Sge > 2 - 39^e^ + 95ge 

(3.7) 



where in the last inequahty, we used the fact that 
Sg < 1, which follows from Claim 13.11 Let T* be 
0.5(fc + 1)7i2+t/3, and let Ct- be the event that the 
chain converges by time T* , we have: 

E [Nt^] = E [Nt' I Ct*] P {Ct- ) + E [Nt* | Ct-] P (Ct- 
<E[Nt'\Ct*]P {Ct- ) + nP (Ct- ) (3.8) 

where in the last inequality we use that Nx* < n. 
Note that ttwis > ^E [iVr. | Ct-] P (Ct*)- Thus 
putting p.7p and p.Sp together, we have: 



2 - 3T* + T*5t-' e - nP (Ct- ) (3.9) 



Having inequality p.9p . the rest of the proof 
consists of establishing an upper-bound for P (Ct* ) 
in Lemma lB.ll and a lower-bound for T*5t-' in Lemma 



Lemma 3.1. Given graph G, suppose the system 
evolves according to the WIS . For e < n~'^, we have 



We prove this lemma by mapping our Markov 
chain into a death-birth chain and using some results 
on death-birth chain and the basic voter model. The 
proof is presented in Section [S] 

Lemma 3.2. Given graph G, suppose at time 0, a 
random pair of neighbors play C and the rest of the 
nodes play D; the system evolves according to the 
WIS afterwards. Ifb/c > -j^, then forn sufficiently 
large: 



The proof of this lemma is presented in Section [6l 
4 Proof of Claim 

In this section, we prove Claim 13.11 by studying the 
expected drift in the number of cooperators when 
we apply the PD dynamics for the first time at t* . 
Suppose node i is chosen to update its action at time 
t* . Substituting the payoff function (|2.ip into the 
sampling probability (|2.2p . we have: 



k - 1 



-kcXl,+bJ2xl. +1 



i E (-kcxii + bY^x^)i 



Simplifying the above expression results in: 

Xl,^, = 1| Al,,Xt',{St'=i}) = ^E^t- 
k-1 



fc2 



E 



-kcxi,+b E ^I'^i' 



E E 



-kcX^,Xl,+b E X't'X. 



Note that the cooperation probability of node i at 
time t* + 1 does not depend on its own action at time 
t* . Further, it is decreasing in the number cooperator 
neighbors, but it is increasing in the number of pairs 
of neighbors playing C and also in the number of 
C — C edges that are incident to neighbors of i. 
Given Xf , the expected drift in the number of 
cooperators, E [Aj*] in Figure [TJ is. 
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-(fc-i)cE^, 



(fc- 1)6 



E E ^I'^i' 
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E E E ^^^^^ 



(4.10) 



Taking expectation over all configurations, Xf G 



{0, 1}", we have: 

-y 

nk ^ 
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(4.11) 



where = E X^.X^, is the pairwise correlation 
of i and j at time t*, which is equal to the joint 
probability that nodes i and j play C at time t*. 
Since up until time t*, the system evolves according 
to the VM dynamics, we analyze the drift E [At*] by 
studying the evolution of the pairwise correlations in 
the voter model. 

Before proceeding with the analysis, to simplify 
the representation, we introduce the average correla- 
tions over all pairs in graph G and rewrite the drift 
E[At*] in terms of these average correlations. For a 
graph with girth at least 7, and for d < 4, let g^'^'' be 
the average joint probability of the two endpoints of 
any path of length d, i.e.. 



Qt 



(d) 



nk{k - ly- 



T E 

ieL{d) 



uivi 

Pt 1 



1 < d < 4 (4.12) 



where L{d) is the set of all length d paths, and ui and 
vi are the two endpoints of path I. Similarly, define 



It 



(0) 



n ^ — ^ 



(4.13) 



iev 



For a graph with girth at least 7, between any pair 
of nodes that appear in the RHS of (|4.1ip , there is a 
unique path of length at most 3. Thus, by rearranging 
the sums in (|4lT|) and using definition (|4.13|) , we can 
write E [Af] in terms of the average correlations: 



E[At.] = 
(fc-l)c 



(0) , (fc - , (2) (fc-]> (3) 



fcc 



-9t 



9r 



fcc 



-qi- 



(4.14) 



previous step in the following way: at time t — \, if 
neither i nor j is chosen to update its action, then the 
joint probability of i and j does not change. On the 
other hand, if i is chosen to update, then it will copy 
neighbor h with probability 1/fc. Therefore, the joint 
probability of i and j at time t will be the same as 
the joint probability of h and j in the previous step 
(time t — 1). Similarly, if j is chosen to update, we 
can write the joint probability of i and j in terms of 
the joint probability of i and neighbors of j at time 
t - 1: 



ij 

Pt 



(1 



nk ^-^ 

(4.15) 



In a graph with girth not smaller than 7, for each 
node i, the subgraph that consists of all the nodes at 
distance at most 3 from i is a fc-regular tree. Further, 
for 1 < d < 3, the average correlation g^'*-' is exactly 
the probability that at time t, a randomly selected 
pair of nodes at distance d paly C. Thus for 1 < d < 3 
and t < t*, the evolution of q^'^'' is similar to the 
evolution of the pairwise correlation in an infinite fc- 
regular tree. In particular, using (|4.15|) and (|4.13p . 
for 1 < d < 3 and t < t* . we have: 



(d-i) 



2(fc 



nk 



l)„(rf+i) 
«t-i > 



(4.16) 



Since at time 0, a random pair of neighbors 
play C, we have q^^ = 2/n, q^^^ ~ 2/{nk), and 
Qq'''' = 0, for d > 2. Starting from these natural 
initial conditions, a simple induction shows that: 



g^^)<,p)<,p)<,«, t<t*. (4.17) 

This monotonicity implies that E[At.] is non- 
decreasing in the ratio fe/c. Thus for 6/c> fc^/(fc — 1), 
we have: 



E[At.] > (fc-l)c 



-Qi 



(0) 



. kq^^ + qi? - kq^ 



4.1 Pairwise Correlations in the Voter Model 

Consider the nodes t and j. For t < t* , wc can write 

Pt"' in terms of the pairwise joint probabilities of the Using the recursive relation (j4.16p . we rewrite E [Ai*] 



as the following geometric sum: 
[k - l)c(l - ^)*' + fcgW + 



(2) 
% 



+ [k - l)c 



k - 1 



1'" 



+ ^9? ^ -{k- l)q. 



(4) 



(4.18) 



Having the initial conditions, it is easy to see that: 



(1) 



(2) 
% 



kqi^ = 0, 



Substituting the initial conditions into (|4.18|) . and 
using dUni), we have E [Aj.] > 0. 

To complete the proof of Claim 13.11 we need to 
show that E[Af.] < 1. This follows directly form 
the definition of E [Af]: the number of cooperators 
cannot increase by more than one after one step. 
Therefore, the expected drift at each time step is at 
most one. 

5 Proof of Lemma 13. II 

First we study the convergence time of our Markov 
chain. Let 9 be the convergence time, i.e., 9 is the 
earliest time that the system reaches either the all 
one {X = 1) or the all zero {X = 0) state. Note that 
our Markov chain is essentially a random walk on the 
cube {0, 1}". In order to compute 9 we can consider a 
slightly different random walk in which there is non- 
zero transition probabilities form state 1 (and 0) to its 
neighbors in the cube. It is easy to see that starting 
from any state in {0, 1}" \ {0, 1}, the time needed to 
hit or 1 is the same in these two chains. However, 
the modified chain is irreducible which makes the 
analysis easier 
1 starting form X and let T 
Clearly 9 <T. 

Next, we proceed to compute E[r]. We show 
that for e < n-^, the hitting time of WIS, E [T] < 
[k + l)n^/4. Note that for e = 0, our dynamics is the 
voter model. The hitting time of the voter model 
on a connected fc-regular graph is upper-bounded 
by kn^/A [TJ Chapter 14, Proposition 9]. In the 
following, we show that for e < n~^, the expected 
convergence time, E[T], is within an additive 0(1) 
factor of the hitting time of the basic voter model, 
which implies that E [T] < {k + l)n^/4. 

Define the mapping M : {0, 1}" — > {0, 1, . . . , n}, 
where A4{X) = X]"=i^'- words, we contract 



Let Tj^ be time needed to hit or 
'^^%e{o,i}'>\{o,i} 



all the states with the same number of cooperators. 
Note that if we apply this mapping to the modified 
irreducible Markov chain, the resulting chain will be 
a reversible death-birth chain. The following results 
for the reversible death-birth chain enable us to show 
that for e sufficiently small, the hitting time of WIS 
is almost the same as the hitting time for the basics 
voter model. 



5.1 Reviev^r of Some Results for Reversible 
Death-Birth Chains: Let tt^, < i < n, be 
the stationary distribution of the death-birth chain, 
Pj-i,j be the transition probability from state j — 1 
to state J, qjj^i be the transition probability from 
state j to state j — 1, and Ej_i [Tj] be the expected 
time to hit state j, starting from state j — 1. 

1. Using the reversibility of the chain, we can easily 
compute the stationary distribution tt^, < i < 
n in terms of the transition probabilities. In 
particular, given ttq. 



TTi = TTq 



3-1 

n 



Pz,z+1 



we can compute ttq by setting X]"=o '^i — ^■ 

2. Using the result of Palacios and Tetali [10], one 
can explicitly compute all the hitting times of 
such a chain. In particular. Theorem 2.3 of [10] 
states that: 



E,. 



1 



(5.19) 



Since the graph is a line, for any two states a 
and b, where < a < 6 < n: 



6-1 



(5.20) 



Having the above results for the reversible death- 
birth chain, we show that for e sufficiently small, 
the hitting time of WIS is almost the same as the 
hitting time of the basics voter model. The difference 
between each transition probability in the WIS and 



the voter model is at most e, therefore, the difference 
in stationary state probability of each state is at most 
O(n^e). From relations (|5.19|) and (|5.20p . it is evident 
that the difference in the hitting times in WIS and 
the voter model is at most 0(n'*e), which is 0(1) for 
e < n~^. 

Finally, note that we have chosen T* such that T* > 
2n^/^E[T]. We upper-bound the probability of event 
Ct* in the following way. By Markov inequality, 

P (T > 2E [T]) < i 

Since the modified chain is irreducible, it satisfies the 
memoryless property and we have: 

P (T > 4E [T] \{T > 2E [T]}) = P (T > 2E [T]) < ^ 

Repeating this argument results in: 

1 



(T > 2iE [T]) < 



2»' 



i > 1 



Thus for T* > 2n''^^E [T], we have, 

1 



< 



which completes the proof of the lemma. 



6 Proof of Lemma [ 

The proof of Lemma 13.21 is mainly algebraic and it 
amounts for obtaining a lower bound for the solution 
of a linear dynamical system. We write (|4.14p in the 
following matrix form: 



E[At^]^{k-l)cY^Qt, 



where 



Y 



-1 

(fc-l)b 
kc 

1 

(fc-l)b 
kc 



and Qt 



(0) 



It 



(1) 



(2) 



(3) 



Using the recursive relation ()4.16p . we have: 
2(fc- 1) 



(fc - l)cY' 



nk 



(6.21) 



where matrix A is: 
1 



A = 



_2_ I _ 2 2(fc-l 
nk n 

^ 








nk 

_ 2 2(fc-l) 

n nk 
2 1 2 

nk 



and vector Rs is 

4=gi4)[0 1]^ 

As mentioned before, for a graph with girth not 
smaller than 7, gi^-* < qi^\ for s < t*. Using this 
inequality and some tedious algebra, we show that: 





T At*-S-1 



Y'A 



i?, > Y' A 



T At* -s-1 







gfV(fc-l)/2 



J3) 



Substituting this inequality in (|6.2ip . we have: 



> (fc - l)cY^ 



s=0 



(6.22 



where vector is: 



Ws=q, 



(3) 



-^{k-l)/2 

1 



We can rewrite (|6.22p as: 

E[Ar] > {k ~ l)cY^ B'' Qo, 
where matrix B is given by. 



B = 



1 








2 

nk 


1-1 

n 


2(fc-l) 






2 

nk 




1-1 

n 

2 

nk 



2(fc-l) 



Summing over < T < T* - 1, 

T*-l 

^ E[A,.]>(fc-l)cf^ 
t*=o 



■T*-l 
.t*=0 



Qo 



The rest of the proof is algebraic; we compute 
the eigenvalue decomposition of matrix B and 
based on that, we establish the lower bound on 



7 WIS on Graphs with Small Girth 

In this section, we present two simple examples, 
complete graph and complete bipartite graph, that 
show why the expected number of cooperators does 
not increase in graphs with small girth. Because of 
the symmetry, we can compute the exact drift in these 
two examples. The main observation is that in these 
graphs, the local clustering of cooperators does not 
occur. For instance, in the complete bipartite graph, 
the set of neighbors of all the nodes in one side of 
the graph is always the same. Thus we have the free- 
riding effect that a defector receives a higher payoff, 
which results in the decrease in the probability of 
imitating cooperation. 

Proposition 7.1. Suppose graph G is the complete 
graph and suppose at time 0, x nodes play C and the 
rest of the nodes play D; the system evolves according 
to the WIS afterwards. For any 6, c > and e > 0, 
we have ttwis < 2/n. 

Proof. Suppose at time t, the configuration Xt has y 
cooperators. Using Relation (|4.10p . a simple counting 
shows that, 



n-2 
n{n — 1)' 



:[b+{n- l)c]y{n-y) 



that is non-positive for any < y < n. Thus 
E[Nt] < E[Nt-i], for all t > 0, which immediately 
implies that ttwis < 2/n. □ 

Proposition 7.2. Suppose graph G is the complete 
bipartite graph and n is even. Also, suppose at time 
0, X nodes play C and the rest of the nodes play D; 
the system evolves according to the WIS afterwards. 
For any &, c > and e > 0, we have nwis < 2/n 

Proof. Suppose at time t, the configuration Xt has 
yi and 7/2 cooperators in the two sides of the graph. 
Using Relation (|4.10|) . a simple counting shows that. 



E 



Atl Xt 



- [(n/2 ~yi + y2)yi{n/2 ~ 1/2) 
n 

+ {nl2-y2+yi)y2{nl2~yi)] 



that is non-positive for any < j/1,2/2 ^ n/2. Thus 
'K[Nt] < E[Nt-i], for all t > 0, which immediately 
implies that ttwis £ 2/n. □ 



dynamics is closely related to a dynamics of Nowak 
et al. [HI E]- In their work, they use the following 
updating rule: node i updates its action to C with 
a probability proportional to the fitness of its coop- 
erator neighbors. Fitness of node j is defined to be 
1 — e + euj, where e is a small positive number. In 
particular. 



E,ai^-e) + eul]Xl 



k{l 



..23) 



This dynamics models the mechanism that node i 
dies and its cooperator (defector) neighbors compete 
to replace i with a cooperator (defector) player in 
proportion to their fitness. When e ^ 1, the 
effect of the payoffs is quite small and the process is 
called Weak Selection. They show that in the Weak 
Selection regime, if 6 > fcc, the expected number of 
cooperators increases in the long run of the process. 
However, their analysis lacks rigor and ignores the 
effect of cycles and the correlation between nodes at 
distance more than one. 

As it is evident from relation (|8.23p . this is 
a nonlinear dynamics and its rigorous analysis is 
prohibitively difficult. However, note that algebraic 
manipulation results in having: 



0(6Vfc) 



which is the same as the transition kernel of the WIS 
process (relation 1^2.3^ ) up to an 0{e^/k) factor. 

8.2 Constant / in Theorem 13.11 We compute 
the constant / by putting Lemma 13.11 and 13.21 to- 
gether: 



^ ^ fc(fc-i)V(fc-i)/2 _ e 



fc2(fc- l)^(fc- l)/2 + A:'" k-1 
... ..s+T 

3/4(fc-^ l)^n" 



2n"'/3 



8 Discussion 



8.1 Relations to Evolutionary Dynamics of 
Novirak et al. As mentioned in Section [1] our WIS 
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